Randomized phase II study of preoperative afatinib in untreated head and neck cancers: predictive and pharmacodynamic biomarkers of activity

There is no strong and reliable predictive biomarker in head and neck squamous cell carcinoma (HNSCC) for EGFR inhibitors. We aimed to identify predictive and pharmacodynamic biomarkers of efficacy of afatinib, a pan-HER tyrosine kinase inhibitor, in a window-of-opportunity trial (NCT01415674). Multi-omics analyses were carried out on pre-treatment biopsy and surgical specimen for biological assessment of afatinib activity. Sixty-one treatment-naïve and operable HNSCC patients were randomised to afatinib 40 mg/day for 21–28 days versus no treatment. Afatinib produced a high rate of metabolic response. Responders had a higher expression of pERK1/2 (P = 0.02) and lower expressions of pHER4 (P = 0.03) and pRB1 (P = 0.002) in pre-treatment biopsy compared to non-responders. At the cellular level, responders displayed an enrichment of tumor-infiltrating B cells under afatinib (P = 0.02). At the molecular level, NF-kappa B signaling was over-represented among upregulated genes in non-responders (P < 0.001; FDR = 0.01). Although exploratory, phosphoproteomics-based biomarkers deserve further investigations as predictors of afatinib efficacy.


Exploratory DNA sequencing (DNAseq) analyses
DNAseq was performed on baseline tumor biopsies from 56 patients, including 37 patients in Arm A and 19 patients in Arm B. Among the 37 patients in Arm A, 35 were evaluable for FDG-positron emission tomography (FDG-PET) response (22 responders and 13 non-responders; Supplementary Fig. 1).

Predictive biomarkers of metabolic response to afatinib
Among the 35 patients evaluable for FDG-PET response in Arm A, none of the genomic alterations nor altered molecular pathways were associated with the metabolic response (Supplementary Table 2).
We next focused on the top 10% of patients with the highest tumor mutational burden (TMB) scores corresponding to a TMB > 15.5 mut/Mb.There was no difference in OS between patients with high TMB (> 15.5 mut/ Mb) and low TMB (≤ 15.5 mut/Mb), with an HR of 0.9 (95% CI, 0.22-4.0;Supplementary Fig. 6).

Exploratory RNA sequencing (RNAseq) analyses
RNAseq was performed on baseline tumor biopsies from 53 patients, including 35 patients in Arm A and 18 patients in Arm B. Among the 35 patients in Arm A, 34 were evaluable for FDG-PET response (19 responders and 15 non-responders).Among them, 26 patients (15 responders and 11 non-responders) had both pre-and post-afatinib tumor samples with available gene expression data analysis (Supplementary Fig. 1).

Cellular level characterization of pharmacodynamic biomarkers
We quantified the numbers and the types of tumor microenvironment (TME) cells among the 26 patients evaluable for FDG-PET response in Arm A with contributive paired pre-and post-afatinib RNAseq.We applied a deconvolution method on bulk gene expression data, and reported the TME cell fraction changes between preand post-afatinib tumor biopsies according to afatinib metabolic response on FDG-PET scan (Fig. 3).We further computed ratios of post-afatinib/pre-afatinib fractions (or differences in case of null denominator) for each type of TME cells, in responders (N = 15) and non-responders (N = 11; Table 3).We showed higher ratios for B cells in responder than in non-responder patients [median (range) ratio, 1.8 (0.9-5.9) versus 0.8 (0.3-5.3); P = 0.02].In contrast, macrophages enrichment was numerically higher in non-responders compared to responders, albeit not statistically significant [median (range) ratio, 1.6 (0.7-3.3) versus 1.1 (0.6-5.4); P = 0.2].

Exploratory reverse phase protein arrays (RPPA) analyses
Protein extracts from baseline tumor biopsies were retrieved from 42 patients (26 patients in Arm A and 16 patients in Arm B) and were evaluated by RPPA methods by using a panel of 77 antibodies.Among the 26 patients in Arm A, 25 were evaluable for FDG-PET response (14 responders and 11 non-responders; Supplementary Fig. 1).www.nature.com/scientificreports/

Predictive biomarkers of metabolic response to afatinib
In the 25 patients evaluable for FDG-PET response in Arm A, we analyzed the relative expressions of phosphoproteins/proteins in pre-afatinib tumor biopsies according to metabolic response (Supplementary Fig. 9).We showed higher phosphorylation levels of p44/p42 MAPK (ERK1/2) in responders than in non-responders [median (range) ratio, 1.

Discussion
Short course of afatinib in untreated, operable, non-metastatic HNSCC patients induced a high rate of partial metabolic responses and partial responses according to RECIST1.1 without delaying surgery.Baseline phosphorylation levels of RB1, ERK1/2, and HER4 correlated with metabolic responses to afatinib.At the cellular level, we showed a significant enrichment of tumor-infiltrating B cells under afatinib in responders, while at the molecular level, the NF-kappa B signaling was found to be enriched among upregulated genes in non-responders.In posthoc analyses, we reported the negative prognostic values of CCND1 amplification and CDKN2A/B codeletion although these results were not confirmed in multivariate analyses.Seven percent of the patients achieved a partial response according to RECIST1.1, and 59% achieved a partial metabolic response as compared with 22% and 70% in a similar study run by the EORTC 28 .The lower response rates observed in our trial might be explained (1) by the higher proportion of patients with advanced disease at diagnosis (51% versus 33% of T4 stage tumors) and ( 2) by a difference in metabolic tumor response assessments, since in the EORTC study the last dose of afatinib was given strictly two hours before the FDG-PET scan before surgery.
Safety and tolerability were as expected regarding the toxicity profile of afatinib.Up to 12% of patients in Arm A stopped afatinib early because of toxicity, which impact the results of translational analyses because of the lack of sufficient drug exposure.Importantly, all reported AEs were manageable and did not delay the planned definitive surgery, the latter representing a major concern with window-of-opportunity trials 29 .
In comparison with the TCGA cohort 30 , cell cycle regulatory genes were less frequently altered in our study, with lower rates of CDKN2A alteration (38% versus 50%) and CDKN2A/B codeletion (16% versus 27%).This might be explained by the fact that we applied a more stringent variant selection algorithm when taking into account copy number deletions.The negative prognostic impact of CDKN2A (that encodes p15) alteration is well established in HNSCC 31,32 , as opposed to CDKN2B (that encodes p16) homozygous deletion.We found that www.nature.com/scientificreports/CDKN2A/B codeletion but not CDKN2A mutation alone correlated with poor survival outcomes, suggesting a key role of both p15 and p16 losses in determining the prognosis of CDKN2A/B co-deleted HNSCC.We identified higher metabolic response to afatinib in patients with low pre-treatment HER4 activity.This result is not in line with what has been observed in preclinical model.A xenograft model of HNSCC identified a HER4-activating oncogenic mutation leading to increased HER4 phosphorylation, which suggested that it may activate other HERs as a heterodimer and predict sensitivity to afatinib 33 .These results highlight the controversial role of HER4 in cancers, which may act as a tumor suppressor protein or as an oncoprotein 34 .Our findings also support the hypothesis that constitutive activation of ERK1/2 (based on assessments of functional T202 and Y204 phosphosites) may serve as predictive biomarker of activity of afatinib.ERK1/2 induces the transcription and the translation of cyclin D1, thus promoting G1/S progression through cyclin D1-CDK4/6-mediated phosphorylation of RB1 35 .Unexpectedly, we reported lower relative expressions of phosphoproteins/proteins for RB1 in responder patients than in non-responder patients, suggesting other mechanisms of cell cycle activation besides CDK4/6-mediated phosphorylation of RB1.
At the cellular level, pharmacodynamic analyses on the TME elucidated B cells changes in response to afatinib which may rely on the (re)activation of a tumor-targeting immune response 36 .In contrast to B cells, macrophages enrichment upon afatinib was found to be higher in non-responders compared to responders, although the difference was not statistically significant (P = 0.2).A number of clinical studies in NSCLC have shown that the degree of infiltration of tumor-associated macrophages positively correlated with disease progression and resistance to EGFR-TKIs 37,38 .In addition, a syngeneic murine model of EGFR-mutant lung tumor recently highlighted the immunosuppressive effects of tumor-associated macrophages on T cells CD8 + , thus impairing the efficacy of osimertinib (a third-generation EGFR-TKI) 39 .At the molecular level, Over Representation Analysis findings suggested mechanisms of resistance to afatinib associated with tumor invasion and metastasis through NF-kappa-B signaling enrichment 40,41 , a molecular feature of the 42-gene Chung's high-risk signature 42 .
Our study has several limitations.First, mean tumor reduction was lower than anticipated and correlative biomarkers analyses were based instead on metabolic response, which was not a predefined primary endpoint.In addition, no adjustment for multi-test were carried out, thus rendering biomarkers analyses mainly exploratory.Second, the specificity of FDG-PET scan to predict afatinib efficacy is unknown.In some pre-clinical and clinical studies, however, metabolic responses to targeted therapies have been shown to correlate with residual tumor cellularity, tumor shrinkage, and improved time to progression [43][44][45] .Third, we did not confirm the biomarkers findings from the LUX-Head and Neck 1 trial 27 , which demonstrated a higher efficacy of afatinib in p16-negative, EGFR-amplified, HER3-low, and PTEN-high HNSCC patients.This can be due to the intrinsic nature of window-of-opportunity trial, in which patients are exposed to drugs during a short period of time, impairing the onset of potential acquired drug resistance.Fourth, recent data underscored the importance of tertiary lymphoid structure in TME and their interplay with B cells in eliciting antitumor responses 46,47 .We did not implement spatial information and functional orientation of tumor-infiltrating immune cells and related B cell phenotypes and functions.In this regard, our understanding of the immune contexture is limited.Fifth, our www.nature.com/scientificreports/patient cohort was heterogeneous and encompassed various primary tumor sites, histologic grades, HPV status, and pre-treatment T-and N-stages, which may have impaired biomarkers discovery.Overall, we found that short course of preoperative afatinib induced a high rate of metabolic response without delaying the planned definitive surgery in treatment-naïve HNSCC patients selected for primary curative surgery.Baseline expression of pHER4, pERK1/2 and pRB1 together with B cells enrichment and NF-kappa B signaling activation correlated with metabolic response to afatinib.Although exploratory, these phosphoproteomics-based biomarkers deserve further investigations as predictors of afatinib efficacy.More mechanistic studies are needed to reliably establish the link between B cells enrichment, NF-kappa B signaling activation and sensitivity to HER family blockers in HNSCC.

Study objectives and endpoints
The main objective consisted in identifying predictive biomarkers of efficacy by exploring correlation between baseline potential biomarkers and radiological and metabolic responses to afatinib.Secondary objectives were to identify potential pharmacodynamic biomarkers, to evaluate the efficacy and safety of afatinib and to assess the metabolic and pathologic responses.We also did post-hoc analyses to evaluate the prognostic significance of recurrent genomic alterations and TMB in HNSCC.
The primary endpoint was the biological assessment of afatinib activity by performing translational researches in pre-treatment biopsy and surgical specimen.Translational researches included DNAseq, RNAseq and high throughput protein analysis using RPPA methods.
For secondary endpoints, efficacy was defined as the tumor size reduction between baseline and before surgery.The tumor size was the sum of two target lesions following measurement rules on CT scan/MRI of the head and neck according to RECIST1.1 48 .Metabolic response was measured on FDG-PET scan according to Positron Emission Tomography Response Criteria in Solid Tumours (PERCIST) 49 .Patients with a partial metabolic response were considered as responders, whereas patients with a stable or a progressive metabolic disease were considered as non-responders.The toxicity was assessed according to the Common Terminology Criteria for Adverse Events (CTCAE) version 4.0.Pathologic response was assessed on resected lymph nodes by the presence or not of invasive tumor.
For post-hoc analyses, OS was defined as the time from randomization until death from any cause or last follow-up.PFS was defined as the time from randomization to disease progression or death from any cause or last follow-up.

Study design and participants
The GEP11 PREDICTOR trial was an open-label, randomized, multicentric, controlled, phase II trial evaluating preoperative afatinib versus no treatment in patients with untreated, operable, non-metastatic T2-4 N0-2 (American Joint Committee on Cancer, 7th edition) HNSCC of the oral cavity, pharynx, and larynx.Randomization was stratified by primary tumor site (oropharynx versus non-oropharynx).All patients provided written informed consent before enrollment.The trial was approved by a national ethics committee.All methods were carried out in accordance with relevant guidelines and regulations.All experimental protocols were approved by UNICANCER.

Treatment and procedures
Patients were randomized (2:1) to receive oral afatinib 40 mg/day for 21 to 28 days (Arm A) or no treatment (Arm B) before surgery.Surgery was not delayed as compared to standard of care.Tumor biopsies were collected during baseline endoscopy and surgery.FDG-PET scans were performed before the endoscopy, and at day 15 after treatment initiation.FDG-PET scans examinations were performed on the same system at baseline and before surgery, and assessed by central imaging review (FDG-PET unit of the François Baclesse Cancer Center) as per the EANM guidelines 50 .CT scan/MRI were performed before the endoscopy and after the end of treatment within one week prior to surgery.

DNA sequencing
Samples were sequenced using a dedicated next-generation sequencing (NGS) panel targeting 571 genes (named "DRAGON").Genes were classified regarding the literature and databases (cBioPortal and OncoKB 51,52 ) in tumor suppressor genes (TSG), oncogenes, and genes considered as both an oncogene and a TSG.Genes were categorized according to the cellular pathway in which they were involved (Supplementary Table 7).

DNAseq protocol
Indexed paired-end libraries of tumor DNA were performed using the Agilent Sureselect XT2 library prep kit.The kit supports sequencing targeted regions of the genome spanning 2.7 Mb.About 50 ng of input DNA were used to build the libraries according to manufacturer's protocol.The pool was finally sequenced on a NovaSeq 6000 (Illumina) S2 × 150 bp flow cell.

Reads mapping
In the first analysis part, reads were mapped using BWA mem software (v0.7.15,Li, 2013) on the Human reference genome (hg19 assembly) using default parameters.As a second quality control, statistics regarding the mapping (percentage of aligned reads total and falling into the capture, percentage of PCR duplicates) and the capture coverage were produced using a combination of SAMtools, flagstat, BEDtools coverage and PicardTools MarkDuplicates.

Coverage quality control
A more detailed notion of each gene percentage, per barcode, covered by at least 100×, in the processed alignments, was also provided using a combination of awk, SAMtools mpileup, BEDtools intersect, multiinter and merge.Bases covered by less than 100× were reported per barcode using the same strategy.Genes belonging to patient pathology were tagged in these two files to facilitate the search for genes of interest that might be badly covered.

Variant selection algorithm
Stringent selection algorithm was applied to remove a maximum of irrelevant variants.We considered a minimal allelic ratio of 5% and a maximal frequency in the population of 0.1%.Only truncating mutations (frameshift deletion and insertion, stopgain, splicing alteration and hotspot mutations from Cancer Hotspot database) with a minimal coverage of 200 reads were retained for tumor suppressor gene variants.All missense variants known to be hotspot mutations from Cancer Hotspot database and no minimal coverage were retained for oncogenes variants 55 .For genes classified as both oncogenes and TSG (such as NOTCH1) or with known missense hotspots (like TP53), truncating mutations with a minimal coverage of 200, and known hotspots mutations with no minimal coverage were selected.

Oncoprints
Oncoprints were drawn using the ComplexHeatmap package and were performed with the Maftools package for 4.00 version of R.
Tumor mutational burden TMB was defined as the number of non-synonymous somatic mutations (SNVs and small indels) per mega-base in coding regions (mut/Mb).Coding variants (except for intronic splicing ones, therefore exons-only which represent 1.59 Mb), without synonymous nor polymorphisms (> 0.1% minor allele frequency [MAF]) and recurrent variants covered enough (not tagged as Low_Depth) were considered in all those calculations.Because the median and range of mutational load have been shown to vary across tumor types, we focused on the top 10% patients with the highest TMB scores and determined the log-rank p value for difference in survival among the 10% top TMB and 90% bottom TMB subgroups of patients.

RNA sequencing
3′ Tag-Seq protocol High-throughput 3′ Tag RNAseq was performed on frozen and formalin fixed and paraffin embedded (FFPE) tumor RNA (100 ng).We used the QuantSeq 3' mRNA-Seq Library Prep Kit (for Illumina) from Lexogen company.In contrast to traditional RNA-seq, which generates sequencing libraries for the whole transcript, 3′ Tag-Seq only generates a single initial library molecule per transcript, complementary to 3′ end sequences.

Data processing
RNAseq data was analyzed using the bioinformatics pipeline developed in Institut Curie and available at https:// github.com/ bioin fo-pf-curie/ RNAseq.Briefly, raw sequencing reads were checked for quality, trimmed for potential adapter sequences and aligned on the Human reference genome with the STAR software.State-of-the-art quality controls such as library complexity, alignment statistics, duplication level, reads annotation were performed in order to ensure a high-quality of RNAseq samples.Gene expression quantification was performed with the featureCounts tool on the coding genes annotation extracted from the Gencode project.

Bioinformatics analyses
Unsupervised clustering k-sparse methods 56 with number of cluster (k) equal to two of bulk-tissue RNAseq data was used.A batch effect-correction algorithm (https:// github.com/ Jfort in1/ ComBa tHarm oniza tion) was then applied in an attempt to mitigate the dataset technical heterogeneity.The ComBat method, initially described in genomics, is a realignment method used to deal with the batch effect.In genomics, the batch effect represents individual variations due only to variations between technicians, laboratories, or the days of manipulations.www.nature.com/scientificreports/

Identification of differentially expressed genes
We computed ratios of surgery/biopsy normalized gene expression levels as measured by RNAseq, with "biopsy" referring to treatment-naïve tumor samples, and "surgery" the paired post-afatinib ones.The Mann-Whitney U test was performed to identify variability of expression ratios between two response (responders versus nonresponders) groups.A cut-off of p-value < 0.05 was applied to select the most differentially expressed genes in the context of therapy.

Pathway enrichment analyses
Over Representation Analysis (ORA) was used for enrichment analysis 57 .Pathway enrichment was computed using the Kyoto Encyclopedia of Genes (KEGG) database (https:// www.kegg.jp/ kegg/ kegg1.html) with hypergeometric test false discovery rate ≤ 0.1.Network-based pathway enrichment analyses were performed using ratios of surgery/biopsy gene expression levels across responders and non-responders in the bulk-tissue RNAseq data.In the bulk-tissue, the differentially expressed genes in the context of therapy that had a p-value < 0.05 were selected as input for pathway analyses.ORA was performed separately by the direction of change in gene expression in order to identify gene sets that are over-represented when we only consider genes that are up-or down-regulated in one condition relative to another.KEGG datasets were retrieved from the R package software clusterProfiler v4.5.0 58 .

Deconvolution of tumor microenvironment immune cells
The R package software quanTIseq 59 was applied to bulk RNAseq data from paired pre-and post-tumor samples to quantify the fractions of ten immune cell types, including B cells, M1 and M2 macrophages, monocytes, neutrophils, natural killer cells, non-regulatory CD4+ T cells, CD8+ T cells, T reg cells and myeloid dendritic cells.quanTIseq also estimated the proportion of uncharacterized cells, namely cells that were present in the cell mixture of interest but were not represented in the signature matrix (e.g., cancer cells).Importantly, quanTIseq scores are proportional to the amount of each cell population in the total sample, thus allowing intra-and intersample comparison for each population.

High throughput protein analysis
High throughput protein analysis used RPPA methods (Supplementary Fig. 10).The samples were processed as previously described 60 and printed onto nitrocellulose covered slides (Supernova, Grace Biolabs) using a dedicated arrayer (2,470 arrayer, Aushon Biosystems).Five serial dilutions, starting at 2000 mg/ml and two technical replicates per dilution were printed for each sample.Arrays were labelled with 77 specific, or without primary antibody (as negative control), as previously described 60 .All primary antibodies used in RPPA have been previously tested by Western Blotting to assess their specificity for the protein of interest.Raw data were normalized using Normacurve 61 , which normalizes for fluorescent background per spot, a total protein stain and potential spatial bias on the slide.Next, each RPPA slide was median centered and scaled (divided by median absolute deviation).We then corrected for remaining sample loading effects individually for each array by correcting the dependency of the data for individual arrays on the median value of each sample, over all 77 arrays, using a linear regression.The panel covered post translationally modified cancer pathway proteins, including phosphorylated and non-phosphorylated proteins derived from cell cycle, cell migration, genome integrity, immunity, metabolism, JAK/STAT signaling, MAPK signaling, NF-kappa B signaling, PI3K/AKT/mTOR signaling, receptor tyrosine kinase signaling, and Wnt/β-catenin signaling pathways (Supplementary Table 8).

Statistical analyses
The sample size was calculated to have a 90% power to detect a difference in treatment effect between biomarker positive versus biomarker negative patients.The treatment effect was defined as the difference in the tumor size evolution between the two arms.Since the observed mean tumor reduction under afatinib was less than the anticipated threshold of 20% 62 , correlative biomarkers analyses were done on the basis of the metabolic response and thus remained mainly exploratory.
All statistical analyses were performed using R software.Comparisons between groups were assessed using the Wilcoxon rank-sum test for quantitative variables, and the Fisher test for qualitative variables.For post-hoc analyses, univariate and multivariate Cox proportional hazard models were performed to identify prognostic biomarkers.The results are presented as hazard ratios (HR) and 95% confidence intervals (CIs).Survival curves were obtained with Kaplan-Meier estimates and compared using the log-rank test.The level of significance was set at P < 0.05.Correction for multiple testing was not applied.

Figure 3 .
Figure 3. Tumor microenvironment cell fraction changes between pre-and post-afatinib tumor biopsies in responder (N = 14) and non-responder patients (N = 11) according to the metabolic response on FDG-PET scan by PERCIST.Tumor microenvironment cells included B cells (A), macrophages (B), (C), neutrophils (D), NK cells (E), T cells CD4+ (F) and T cells CD8+ (G).Uncharacterized cells, which were representative of cells outside immune subtypes (including tumor cells), were also reported (H).

Figure 4 .
Figure 4. Gene sets which were found to be over-represented among up-regulated genes (N = 267) in nonresponder patients, as determined by Over Representation Analysis KEGG Encyclopedia Genes datasets, FDR false discovery rate.

Table 1 .
Patient characteristics.a American Joint Committee on Cancer, 7th edition staging.

Table 3 .
Variation of fraction of tumor-infiltrating immune cells under afatinib according to metabolic response on FDG-PET scan by PERCIST.NS not significant.$ Uncharacterized cells are representative of cells outside immune including tumor cells.¶ Median fractions were computed in B cells, macrophages, NK T cells CD4+ and CD8+, and uncharacterized cells; median immune cell fractions derived by subtracting the post-afatinib cell fraction from the pre-afatinib cell fraction were computed in monocytes, neutrophils, and T cells CD8+.Significant values are in bold.Vol:.(1234567890)Scientific Reports | (2023) 13:22524 | https://doi.org/10.1038/s41598-023-49887-4